AITopics | building data pipeline

Collaborating Authors

building data pipeline

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

The Data Engineering Pipeline

#artificialintelligenceJul-4-2022, 16:20:10 GMT

Originally published on Towards AI the World's Leading AI and Technology News and Media Company. If you are building an AI-related product or service, we invite you to consider becoming an AI sponsor. At Towards AI, we help scale AI and technology startups. Let us help you unleash your technology to the masses. Data Engineers are at the heart of the engine room of any data-driven company.

data pipeline, etl pipeline, pipeline, (11 more...)

#artificialintelligence

Technology:

Information Technology > Data Science > Data Integration (0.81)
Information Technology > Artificial Intelligence > Representation & Reasoning > Information Fusion (0.81)

Add feedback

Learn the best practices for building data pipelines

#artificialintelligenceJun-28-2021, 01:06:04 GMT

According to Oracle, feature extraction is an attribute reduction process, which results in a much smaller and richer set of attributes. Depending on the requirements, identifying and extracting informative and compact data sets (for an ML model) may need structured data like numbers and dates or unstructured data like categorical features and raw text. If the data volume is large, the feature extraction can be handled separately, and the generated features can be stored in the storage layer. The format of the stored features is ready for direct consumption by the ML training process in the next phase. The feature extraction can be done for a wide range of applications like simple ETL process, model prediction pipeline, or retraining the model based on new data to improve the model accuracy.

building data pipeline, feature extraction, pipeline, (4 more...)

#artificialintelligence

Technology:

Information Technology > Data Science > Data Mining (0.89)
Information Technology > Artificial Intelligence > Machine Learning (0.66)
Information Technology > Data Science > Data Integration (0.60)
Information Technology > Artificial Intelligence > Representation & Reasoning > Information Fusion (0.60)

Add feedback

The Key to Building Data Pipelines for Machine Learning: Support for Multiple Engines - NASSCOM Community The Official Community of Indian IT Industry :))iiğ

#artificialintelligenceMay-18-2020, 16:40:18 GMT

As a consumer of goods and services, you experience the results of machine learning (ML) whenever the institutions you rely on use ML processes to run their operations. You may receive a text message from a bank requiring verification after the bank has paused a credit card transaction. Or, an online travel site may send you an email that offers personalized accommodations for your next personal or business trip. The work that happens behind the scenes to facilitate these experiences can be difficult to fully realize or appreciate. An important portion of that work is done by the data engineering teams that build the data pipelines to help train and deploy those ML models.

artificial intelligence, data mining, pipeline, (18 more...)

#artificialintelligence

Industry: Banking & Finance > Credit (0.55)

Technology:

Information Technology > Data Science > Data Mining > Big Data (1.00)
Information Technology > Artificial Intelligence (1.00)

Add feedback

Building Data Pipelines with Teraport for Feature Engineering - Loominus

#artificialintelligenceOct-4-2019, 21:26:10 GMT

Data pipelines are where most of the time is spent for those working with data because the bulk of a machine learning project involves data collection and cleaning. Loominus gives everyone the power to build the data pipelines critical to any machine learning project. Teraport is a powerful tool within the Loominus product suite that ingests and stages data. In another post, we'll discuss the data ingestion APIs. We're going to build a data pipeline that generates the average credit score of borrowers within a portfolio of loans.

data pipeline, pipeline, teraport, (13 more...)

#artificialintelligence

Industry: Banking & Finance (0.49)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback